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6 



ENTERED 



<110> APPLICANT: CuraGen Corporation 
Vernet, Corine A.M. 
Fernandes, Elma R 
Taupier, Raymond J 
Quinn, Kerry E 
Spytek, Kimberly A 
Rastelli, Luca 
Herrman, John L 

<120> TITLE OF INVENTION: Novel Proteins and Nucleic Acids Encoding Same 
<130> FILE REFERENCE: 15966-750 

<140> CURRENT APPLICATION NUMBER: 09/825, 751A 
<141> CURRENT FILING DATE: 2001-04-30 

<150> PRIOR APPLICATION NUMBER: 60/194,314 

<151> PRIOR FILING DATE: 2000-04-03 

<150> PRIOR APPLICATION NUMBER: 60/225,693 

<151> PRIOR FILING DATE: 2000-08-16 

<160> NUMBER OF SEQ ID NOS : 85 

<170> SOFTWARE: Patentln Ver . 2.1 

<210> SEQ ID NO: 1 

<211> LENGTH: 1852 

<212> TYPE: DNA 



<213> ORGANISM 
<2 20> FEATURE: 
<221> NAME/KEY 
<222> LOCATION 
<4 00> SEQUENCE 



Homo sapiens 



CDS 

(208) . . (1698) 
1 

cggatgactc ccgagaaggt gagcccctca cccacatgct aagagcccct tctgggccac 60 

ccagatccat ctccgcactg cctgggtctc tgagtttcag gctccccctg agagcctggg 120 

tggccctgga ccctgccagc ctggggcttg ggcttttgtc cccttggggc cttgagtgtg 180 

4 5 gccagggctc tggcgattgt gtggtga cag aag cca tgt ctg caa cgc ctg cca 2 34 



46 



Gin Lys Pro Cys Leu Gin Arg Leu Pro 



47 


















i 








5 










49 


tec 


gca 


gac 


gtg 


aat 


gag 


tgt 


gca 


gag 


aac 


cct 


ggc 


gtc 


tgc 


act 


aac 


282 


50 


Ser 


Ala 


Asp 


Val 


Asn 


Glu 


Cys 


Ala 


Glu 


Asn 


Pro 


Gly 


Val 


Cys 


Thr 


Asn 




51 


10 










15 










20 










25 




53 


ggc 


gtc 


tgt 


gtc 


aac 


acc 


gat 


gga 


tec 


ttc 


cgc 


tgt 


gag 


tgt 


ccc 


ttt 


330 


54 


Gly 


Val 


Cys 


Val 


Asn 


Thr 


Asp 


Gly 


Ser 


Phe 


Arg 


Cys 


Glu 


Cys 


Pro 


Phe 




55 










30 










35 










40 






57 


ggc 


tac 


age 


ctg 


gac 


ttc 


act 


ggc 


ate 


aac 


tgt 


gtg 


gac 


aca 


gac 


gag 


378 


58 


Gly 


Tyr 


Ser 


Leu 


Asp 


Phe 


Thr 


Gly 


He 


Asn 


Cys 


Val 


Asp 


Thr 


Asp 


Glu 




59 








4 5 










50 










55 








61 


tgc 


tct 


gtc 


ggc 


cac 


ccc 


tgt 


ggg 


caa 


ggg 


aca 


tgc 


acc 


aat 


gtc 


ate 


426 


62 


Cys 


Ser 


Val 


Gly 


His 


Pro 


Cys 


Gly 


Gin 


Gly 


Thr 


Cys 


Thr 


Asn 


Val 


He 
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63 60 65 70 

65 gga ggc ttc gaa tgt gcc tgt get gac ggc ttt gag cct ggc etc atg 474 

66 Gly Gly Phe Glu Cys Ala Cys Ala Asp Gly Phe Glu Pro Gly Leu Met 

67 75 80 85 

69 atg acc tgc gag gac ate gac gaa tgc tec ctg aac ccg ctg etc tgt 522 

70 Met Thr Cys Glu Asp lie Asp Glu Cys Ser Leu Asn Pro Leu Leu Cys 

71 90 95 100 105 

73 gcc ttc cgc tgc cac aat acc gag ggc tec tac ctg tgc acc tgt cca 570 

74 Ala Phe Arg Cys His Asn Thr Glu Gly Ser Tyr Leu Cys Thr Cys Pro 

75 110 115 120 

77 gcc ggc tac acc ctg egg gag gac ggg gcc atg tgt cga gat gtg gac 618 

78 Ala Gly Tyr Thr Leu Arg Glu Asp Gly Ala Met Cys Arg Asp Val Asp 

79 125 130 135 

81 gag tgt gca gat ggt cag cag gac tgc cac gcc egg ggc atg gag tgc 666 

82 Glu Cys Ala Asp Gly Gin Gin Asp Cys His Ala Arg Gly Met Glu Cys 

83 140 145 150 

85 aag aac etc ate ggt acc ttc gcg tgc gtc tgt ccc cca ggc atg egg 714 

86 Lys Asn Leu lie Gly Thr Phe Ala Cys Val Cys Pro Pro Gly Met Arg 

87 155 160 165 

89 ccc ctg cct ggc tct ggg gag ggc tgc aca gat gac aat gaa tgc cac 762 

90 Pro Leu Pro Gly Ser Gly Glu Gly Cys Thr Asp Asp Asn Glu Cys His 

91 170 175 180 185 

93 get cag cct gac etc tgt gtc aac ggc cgc tgt gtc aac acc gcg ggc 810 

94 Ala Gin Pro Asp Leu Cys Val Asn Gly Arg Cys Val Asn Thr Ala Gly 

95 190 195 . 200 

97 age ttc egg tgc gac tgt gat gag gga ttc cag ccc age ccc acc ctt 858 

98 Ser Phe Arg Cys Asp Cys Asp Glu Gly Phe Gin Pro Ser Pro Thr Leu 

99 205 . 210 215 . 

101 acc gag tgc cac gac ate egg cag ggg ccc tgc ttt gcc gag gtg ctg 906 

102 Thr Glu Cys His Asp lie Arg Gin Gly Pro Cys Phe Ala Glu Val Leu 

103 220 225 230 

105 cag acc atg tgc egg tct ctg tec age age agt gag get gtc acc agg 954 
10 6 Gin Thr Met Cys Arg Ser Leu Ser Ser Ser Ser Glu Ala Val Thr Arg 
107 235 240 245 

109 gcc gag tgc tgc tgt ggg ggt ggc egg ggc tgg ggg ccc cgc tgc gag 1002 

110 Ala Glu Cys Cys Cys Gly Gly Gly Arg Gly Trp Gly Pro Arg Cys Glu 

111 250 255 260 265 

113 etc tgt ccc ctg ccc ggc acc tct gcc tac agg aag ctg tgc ccc cat 1050 

114 Leu Cys Pro Leu Pro Gly Thr Ser Ala Tyr Arg Lys Leu Cys Pro His 

115 270 275 280 

117 ggc tea ggc tac act get gag ggc cga gat gta gat gaa tgc cgt atg 1098 

118 Gly Ser Gly Tyr Thr Ala Glu Gly Arg Asp Val Asp Glu Cys Arg Met 

119 285 290 295 

121 ctt get cac ctg tgt get cat ggg gag tgc ate aac age ctt ggc tec 1146 

122 Leu Ala His Leu Cys Ala His Gly Glu Cys lie Asn Ser Leu Gly Ser 

123 300 305 310 

125 ttc cgc tgc cac tgt cag gcc ggg tac aca ccg gat get act get act 1194 
12 6 Phe Arg Cys His Cys Gin Ala Gly Tyr Thr Pro Asp Ala Thr Ala Thr 
127 315 320 325 
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129 


acc 


tgc 


ctcr 


crat 


atcr 


gat 


cracr 


tcrc 


age 


cag 


crtc 


ccc 


aag 


cca 


tgt 


acc 


1242 


130 


Thr 


Cys 


Leu 


Asp 


Met 


Asp 


Glu 


Cys 


Ser 


. Gin 


Val 


Pro 


Lys 


Pro 


Cys 


Thr 




1 11 


j -j \j 










335 










340 










345 




1 11 




etc 


+■ err 1 
uy <~ 


aaa 


aac 


acg 


aag 


rrrr p 

y y 


agt 


ttc 


*~ uy 


+- rr p 

uy o 


age 


tat 
uy u 


ccc 


cga 


1290 


134 


Phe 


Leu 


Cys 


Lys 


Asn 


Thr 


Lys 


Glv 


Ser 


Phe 


Leu 


Cys 


Ser 


Cys 


Pro 


Arg 




135 










350 










355 










360 






137 


cr err 


tac 


ct* cr 
^ uy 


ctg 


era cr 

y 01 y 


era cr 

y a y 


gat 


rrrrp 

yy v - 


a crcr 
cay y 


acc 


1" (TP 

L.y u. 


aaa 


gac 


ft cr 

u. uy 


ga c 


gaa 


1338 


138 


Glv 


Tvr 

JL 


Leu 


Leu 


Glu 


Glu 


Asp 


Glv 


Arcr 


Thr 


Cys 


Lys 


Asp 


Leu 


Asp 


Glu 




139 








365 










370 










375 








1 A 1 

X ** X 




a L- L. 


+■ p p 


PfTfT 

egg 


p^ rr 
Lay 


o d u- 


o ^ /~< 


T-rrT- 

T_gi_ 


p^ rr 


U UU- 


pi - p 

U- LL< 




rr +- /-i 


ClO. U- 


3 pt 

a u- u 


rr i - rr 

g ug 


1 IRfi 

X O O D 


142 


Cys 


Thr 


Ser 


Arcr 

«.x y 


Gin 


His 


Asn 


Cys 


Gin 


Phe 


Leu 


Cys 


Val 


Asn 


Thr 


Val 




14 3 






380 










385 










390 










us 

It J 


rrrrp 
y y o 


gec 


ttc 


a c c 


t* rrp 
L.y o 


cgc 


uy u 


cea 


cc c 


CTCT P 

yy° 


ttc 


ace 


ca g 


cac 


cac 


cag 


14 34 


146 


Glv 

_i_ jr 


Ala 


Phe 


Thr 


Cys 


Arcr 


Cys 


Pro 


Pro 


Glv 


Phe 


Thr 


Gin 


His 


His 


Gin 




147 




395 










400 










405 












14 9 




"H rr p 


ttc 


ga c 


aat 


gat 


y ay 


uy u- 


tea 


gec 


cag 


c c t 


rrrrp 
y y u- 


cca 


tcrt 
uy u 


rrrr1~ 
y y u 


1482 


150 


Ala 


Cys 


Phe 


Asp 


Asn 


Asp 


Glu 


Cys 


Ser 


Ala 


Gin 


Pro 


Glv 


Pro 


Cys 


Glv 




151 


410 










415 










420 










425 




1 si 

X -J J 




ca c 


CTCTCT 

y y y 


ca c 


uy e 




aac 


a cc 


ccg 


rrrr p 
y y u 


^ rrp 
ciy 


U L. U» 


cgc 


i_y u 


gaa 


T~ (TP 

uy o 


1530 


154 


Ala 


His 


Glv 

JL 


His 


Cys 


His 


Asn 


Thr 


Pro 


Gly 


Ser 


Phe 


Arg 


Cys 


Glu 


Cys 




155 










4 30 










435 










440 






157 


ca c 


caa 


y y ^ 


ttc 


ace 


u- uy 


gtc 


age 


tea 


ggc 


cat 


ggc 


tgt 


gaa 


gat 


gtg 


1578 


158 


His 


Gin 


Gly 


Phe 


Thr 


Leu 


Val 


Ser 


Ser 


Gly 


His 


Gly 


Cys 


Glu 


Asp 


Val 




159 








445 










450 










455 








161 


aat 


gaa 


tgt 


gat 


ggg 


ccc 


cac 


cgc 


tgc 


cag 


cat 


ggc 


tgt 


cag 


aac 


cag 


1626 


X D Zi 


Asn 


Glu 


Cys 


Asp 


Gly 


Pro 


His 


Arg 


Cys 


Gin 


His 


Gly 


Cys 


Gin 


Asn 


Gin 




1 1 

X D O 






460 










465 










470 










165 


eta 


ggg 


ggc 


tac 


cgc 


tgc 


age 


tgc 


ccc 


cag 


ggt 


ttc 


acc 


cag 


cac 


tec 


1674 


16 6 


Leu 


Gly 


Gly 


Tyr 


Arg 


Cys 


Ser 


Cys 


Pro 


Gin 


Gly 


Phe 


Thr 


Gin 


His 


Ser 




167 




475 










480 










485 












169 


cag 


tgg 


gec 


cag 


tgt 


gtg 


ggt gag 


tgaaaagggc tgggaagaag ctgggccctc 


1728 


1 70 

X / \J 


Gin 


Trp 


Ala 


Gin 


Cys 


Val 


Gly 


Glu 




















171 


490 










495 
























X / o 


caccagaatc tgetcagage aggegactaa 


l cagacgccac 


ectgeaagat gatgtgacaa 


1 7RR 

X / o o 


175 


gcacaattat ctaaagattg aacaggecag cccagaagat 


gagaatgagt gtgccctgtc 


184 8 


177 


gece 






























1 RS? 

X O J -£i 


180 


<210> SEQ ID NO: 


: 2 


























1 ftl 

X O X 


<211> LENGTH: 497 


























182 


<212> TYPE: 


PRT 




























183 


<213> ORGANISM: 


Homo sapiens 




















185 


<4 00> SEQUENCE: 


2 


























186 


Gin 


Lys 


Pro 


Cys 


Leu 


Gin 


Arg 


Leu 


Pro 


Ser 


Ala 


Asp 


Val 


Asn 


Glu 


Cys 




187 


1- 








5 










10 










15 






189 


Ala 


Glu 


Asn 


Pro 


Gly 


Val 


Cys 


Thr 


Asn 


Gly Val 


Cys 


Val 


Asn 


Thr 


Asp 




190 








20 










25 










30 








192 


Gly 


Ser 


Phe 


Arg 


Cys 


Glu 


Cys 


Pro 


Phe 


Gly 


Tyr 


Ser 


Leu 


Asp 


Phe 


Thr 




193 






35 










40 










45 










195 


Gly 


He 


Asn 


Cys 


Val 


Asp 


Thr 


Asp 


Glu 


Cys 


Ser 


Val 


Gly 


His 


Pro 


Cys 




196 




50 










55 










60 
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198 Gly Gin Gly Thr Cys Thr Asn Val lie Gly Gly Phe Glu Cys Ala Cys 

199 65 70 75 80 

201 Ala Asp Gly Phe Glu Pro Gly Leu Met Met Thr Cys Glu Asp lie Asp 

202 85 90 95 

204 Glu Cys Ser Leu Asn Pro Leu Leu Cys Ala Phe Arg Cys His Asn Thr 

205 100 105 110 

207 Glu Gly Ser Tyr Leu Cys Thr Cys Pro Ala Gly Tyr Thr Leu Arg Glu 

208 115 120 125 

210 Asp Gly Ala Met Cys Arg Asp Val Asp Glu Cys Ala Asp Gly Gin Gin 

211 130 135 140 

213 Asp Cys His Ala Arg Gly Met Glu Cys. Lys Asn Leu lie Gly Thr Phe 

214 145 150 155 160 

216 Ala Cys Val Cys Pro Pro Gly Met Arg Pro Leu Pro Gly Ser Gly Glu 

217 165 170 175 

219 Gly Cys Thr Asp Asp Asn Glu Cys His Ala Gin Pro Asp Leu Cys Val 

220 180 185 190 

222 Asn Gly Arg Cys Val Asn Thr Ala Gly Ser Phe Arg Cys Asp Cys Asp 

223 195 200 205 

225 Glu Gly Phe Gin Pro Ser Pro Thr Leu Thr Glu Cys His Asp lie Arg 

226 210 215 220 

228 Gin Gly Pro Cys Phe Ala Glu Val Leu Gin Thr Met Cys Arg Ser Leu 

229 225 230 235 240 

231 Ser Ser Ser Ser Glu Ala Val Thr Arg Ala Glu Cys Cys Cys Gly Gly 

232 245 250 255 

2 34 Gly Arg Gly Trp Gly Pro Arg Cys Glu Leu Cys Pro Leu Pro Gly Thr 
235 260 265 ' 270 

2 37 Ser Ala Tyr Arg Lys Leu Cys Pro His Gly Ser Gly Tyr Thr Ala Glu 
238 275 280 285 

240 Gly Arg Asp Val Asp Glu Cys Arg Met Leu Ala His Leu Cys Ala His 

241 290 295 300 

24 3 Gly Glu Cys lie Asn Ser Leu Gly Ser Phe Arg Cys His Cys Gin Ala 
244 305 310 315 320 

246 Gly Tyr Thr Pro Asp Ala Thr Ala Thr Thr Cys Leu Asp Met Asp Glu 

247 325 330 335 . 

249 Cys Ser Gin Val Pro Lys Pro Cys Thr Phe Leu Cys Lys Asn Thr Lys 

250 340 345 350 

2 52 Gly Ser Phe Leu Cys Ser Cys Pro Arg Gly Tyr Leu Leu Glu Glu Asp 
253 355 360 365 

255 Gly Arg Thr Cys Lys Asp Leu Asp Glu Cys Thr Ser Arg Gin His Asn 

256 370 375 380 

2 58 Cys Gin Phe Leu Cys Val Asn Thr Val Gly Ala Phe Thr Cys Arg Cys 
259 385 390 395 400 

261 Pro Pro Gly Phe Thr Gin His His Gin Ala Cys Phe Asp Asn Asp Glu 

262 405 410 415 

264 Cys Ser Ala Gin Pro Gly Pro Cys Gly Ala His Gly His Cys His Asn 

265 420 425 430 

267 Thr Pro Gly Ser Phe Arg Cys Glu Cys His Gin Gly Phe Thr Leu Val 

268 435 440 445 

270 Ser Ser Gly His Gly Cys Glu Asp Val Asn Glu Cys Asp Gly Pro His 
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0 71 
Z / 1 


450 






455 










460 










O "J "3 


Arg Cys Gin His 


Gly 


Cys 


Gin 


Asn 


C~\ n 

VjXll 


j_ieu 


ri "1 v 




xyx 




Cys 


Ser 


A /4 


465 




470 










A 7 R 










480 


Z / b 


Cys Pro Gin Gly 


Phe 


Thr 


Gin 


His 


C £"» >- 


n n 


irp 




Vj X 11 


y o 


Val 


Gly 


277 




485 










a q n 
4 y v 










H O 




2 79 


Glu 


























z o4 


<210> SEQ ID NO 


3 
























285 


<211> LENGTH: 379 
























2 86 


<zlz> TYPE : DNA 


























*1 O T 

2 8/ 


<213> ORGANISM: 


Homo sapiens 


















289 


<2zU> FEATURE: 


























290 


<221> NAME/KEY: 


CDS 
























^ O 1 

2 91 


<222> LOCATION: 


(1) 


. (378) 




















2 93 


<400> SEQUENCE: 


3 
























2 94 


gga ggg cct gtg 


atx 


eta 


ctg 


cag 


gca 


ggc 


acc 


ccc 


cae 


aac 


etc 


a f" a 
aOa 


a y d 


Gly Gly Pro Val 


lie 


Leu 


Leu 


Gin 


Ala 


(Z "1 \r 


Thr* 

X 111. 






Aon 
noil 


Leu 


Thr 


O Q c 

zy b 


1 


5 










1 0 










X — ' 




298 


tgc egg gec ttc 


aat 


gcg 


aag 


cct 


get 


gec 


acc 


dtC 


dLC 


tgg 




egg 


O Q Q 

z y y 


Cys Arg Ala Phe 


Asn 


Ala 


Lys 


Pro 


H. J_ cl 


Ala 

rV la 


X 111 


lie 


Tl p 
X X c 


xrp 


Phe 


Arg 


300 












z D 
















302 


gac ggg acg cag 


cag 


gag 


ggc 


get 


gtg 


gee 


age 


acg 


gaa 


+■ +■ rr 

rrg 


ctg 


aag 


jU J 


Asp Gly Thr Gin 


Gin 


Glu 


Gly 


Ala 


Val 




C o >~ 

ocl 


T* V-> y 
X 11X 


VjX u 


Leu 


Leu 


Lys 


304 


35 








40 










ft D 








306 


gat ggg aag agg 


gag 


acc 


acc 


gtg 


age 


caa 


ctg 


r*+- +■ 

ctu 


a. L. U 


aac 


ccc 


acg 


q n 7 


Asp Gly Lys Arg 


Glu 


Thr 


Thr 


Val 


Cor 


ol 11 


Leu 


Leu 


X xc 


Aon 
noli 


Prn 


Thr 


308 


50 






55 










D U 










310 


gac ctg gac ata 


ggg 


cgt 


gtc 


ttc 


-l y-1 4- 

acu 


tgc 


cga 


age 


aLy 


aac 


gaa 


gee 


311 


Asp Leu Asp lie 


Gly 


Arg 


Val 


Phe 


inr 


Cys 


Arg 




ne u 


Asn 


C 1 n 
ul Ll 


A 1 o 
nla 


312 


65 




70 






















314 


ate cct agt ggc 


aag 


gag 


act 


tec 


arc 


gag 


ctg 


ga l 


gtg 


cae 


cae 


cct 


315 


lie Pro Ser Gly 


Lys 


Glu 


Thr 


Ser 


lie 


bill 


Leu 


Asp 


Val 


nis 


fll o 


It X vj 


QIC 




85 










on 














318 


cct aca gtg acc 


ctg 


tec 


att 


gag 


cca 


cag 


acg 


ggg 


cag 


gag 


ggt 


gag 


319 


Pro Thr Val Thr 


Leu 


Ser 


He 


Glu 


Pro 


QiXn 


Thr 


Cjiy 


Gin 


VjIU 


(jiy 


blu 


320 


100 










1 AC 

JLUD 










i i n 
1 1 u 






322 


cgt gtt gtc ttt 


acc 


tgc 


cag 


gec 


aca 


gec 


aac 


ccc 


gag 


ate 


4- 




in 
j z o 


Arg Val Val Phe 


Thr 


Cys 


Gin 


Ala 


Thr 


Ala 


Asn 


Pro 




lie 






324 


115 








120 










i n c 
IZD 








327 


<210> SEQ ID NO 


: 4 
























328 


<211> LENGTH: 126 
























329 


<212> TYPE: PRT 


























330 


<213> ORGANISM: 


Homo sapiens 


















332 


<400> SEQUENCE: 


4 
























333 


Gly Gly Pro Val 


He 


Leu 


Leu 


Gin 


Ala 


Gly 


Thr 


Pro 


His 


Asn 


Leu 


Thr 


334 


1 


5 










10 










15 




336 


Cys Arg Ala Phe 


Asn 


Ala 


Lys 


Pro 


Ala 


Ala 


Thr 


He 


He 


Trp 


Phe 


Arg 


337 


20 










25 










30 






339 


Asp Gly Thr Gin 


Gin 


Glu 


Gly 


Ala 


val 


Ala 


Ser 


Thr 


Glu 


Leu 


Leu 


Lys 


340 


35 








40 










45 









48 



96 



144 



192 



240 



288 



336 



379 



IT* of n and/or Xaa has been detected in the Sequence Listin 
each sequence using n or Xaa. 
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L:17 M:271 C: Current Filing Date differs, Replaced Current Filing Date 
L:2236 M:341 W: (46) n n" or "Xaa" used, for SEQ ID# : 57 
L:2237 M:341 W: (46) n n" or "Xaa" used, for SEQ ID#:57 
L:2291 M:341 W: (46) n n" or "Xaa" used, for SEQ ID#:61 
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